The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
نویسندگان
چکیده
Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq) to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp.) and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs) in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis), with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha) and identified gene ontology categories with elevated values of alpha. The "response to biotic stimulus" category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio) and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi). We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.
منابع مشابه
Genomics of homoploid hybrid speciation: diversity and transcriptional activity of long terminal repeat retrotransposons in hybrid sunflowers.
Hybridization is thought to play an important role in plant evolution by introducing novel genetic combinations and promoting genome restructuring. However, surprisingly little is known about the impact of hybridization on transposable element (TE) proliferation and the genomic response to TE activity. In this paper, we first review the mechanisms by which homoploid hybrid species may arise in ...
متن کاملO-11: N-a-acetyltransferase 10 Protein Regulates DNA Methylation and Embryonic Development
Background Genomic imprinting is a heritable and developmentally essential phenomenon by which gene expression occurs in an allele-specific manner1. While the imprinted alleles are primarily silenced by DNA methylation, it remains largely unknown how methylation is targeted to imprinting control region (ICR), also called differentially methylated region (DMR), and maintained. Here we show that ...
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملBiochemical characterization of PE_PGRS61 family protein of Mycobacterium tuberculosis H37Rv reveals the binding ability to fibronectin
Objective(s): The periodic binding of protein expressed by Mycobacterium tuberculosis H37Rv with the host cell receptor molecules i.e. fibronectin (Fn) is gaining significance because of its adhesive properties. The genome sequencing of M. tuberculosis H37Rv revealed that the proline-glutamic (PE) proteins contain polymorphic GC-rich repetitive sequences (PGRS) which have clinical importance i...
متن کاملInvestigation of GDF9 and BMP15 Polymorphisms in Mehraban Sheep to Find the Missenses as Impact on Protein
Utilization of fecundity genes such as GDF9 and BMP15 can help improve reproductive traits in sheep breeding programme. To evaluate effects of missense mutations on protein function, the polymorphisms of GDF9 and BMP15 genes were screened in twelve mehraban sheep using DNA sequencing, followed by protein structure modeling. Six single nucleotide polymorphism (SNPs) known as FecG mutations (G1-G...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 1 شماره
صفحات -
تاریخ انتشار 2012